Bayesian nonparametric modeling for disease incidence data

نویسنده

  • Athanasios Kottas
چکیده

Disease incidence or mortality data are routinely recorded as summary counts for contiguous geographical regions (e.g., census tracts, zip codes, districts, or counties) and collected over discrete time periods. The count responses are typically accompanied by covariate information associated with the region (e.g., median family income, or percent with a specific type of education), and occasionally, by covariate information associated with each incidence case (e.g., sex, race, age), even though we only know the region into which the case falls. A key inferential objective in the analysis of disease incidence data is identification and explanation of spatial and spatio-temporal patterns of disease risk (disease mapping). Also of interest is forecasting of disease risk. The statistical literature of the past twenty five years or so has witnessed a growing emphasis on fairly sophisticated methods to model heterogeneity in disease event rates. Most of the methodology has been developed within a hierarchical framework through introduction of spatial and spatio-temporal models tailored to the disease mapping inference goals. In this context, the Bayesian approach to modeling and inference is naturally attractive. In this chapter, we review Bayesian nonparametric spatial and spatio-temporal modeling approaches for disease incidence data. Section 2 provides the necessary background on nonparametric priors, mainly the Dirichlet process prior, and its extension to spatial Dirichlet process models. Bayesian nonparametric work has focused on modeling methods for the stochastic mechanism that generates the region-specific count responses, and this is where we place the emphasis in this review, providing only brief discussion on modeling the covariate information. We thus focus on distributional specifications for the disease incidence counts (number of observed cases of the disease). These count responses are denoted by yit, where i = 1, ...,m indexes the geographic regions Ai, and t = 1, ..., T indexes the (discrete) time periods. Note that although cases occur at specific spatial point locations, the available responses are associated with entire ∗Athanasios Kottas ([email protected]) is Professor, Department of Applied Mathematics and Statistics, University of California, Santa Cruz, CA, 95064, USA. The author wishes to thank Andrew Lawson and Sudipto Banerjee for the invitation to write this book chapter, as well as a reviewer for useful feedback. This work was supported in part by the National Science Foundation under award DMS 1310438.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Bayesian Nonparametric Spatio-Temporal Models for Disease Incidence Data

Typically, disease incidence or mortality data are available as rates or counts for specified regions, collected over time. We propose Bayesian nonparametric spatial modeling approaches to analyze such data. We develop a hierarchical specification using spatial random effects modeled with a Dirichlet process prior. The Dirichlet process is centered around a multivariate normal distribution. Thi...

متن کامل

مدل یابی انتشار بیماری های عفونی بر اساس رویکرد آماری بیز

Background and Aim: Health surveillance systems are now paying more attention to infectious diseases, largely because of emerging and re-emerging infections. The main objective of this research is presenting a statistical method for modeling infectious disease incidence based on the Bayesian approach.Material and Methods: Since infectious diseases have two phases, namely epidemic and non-epidem...

متن کامل

Bayesian Nonparametric Modeling and Theory for Complex Data

Bayesian Nonparametric Modeling and Theory for Complex Data

متن کامل

A Disease Outbreak Prediction Model Using Bayesian Inference: A Case of Influenza

Introduction: One major problem in analyzing epidemic data is the lack of data and high dependency among the available data, which is due to the fact that the epidemic process is not directly observable. Methods: One method for epidemic data analysis to estimate the desired epidemic parameters, such as disease transmission rate and recovery rate, is data ...

متن کامل

Assessment of Neonate's Congenital Hypothyroidism Pattern Using Poisson Spatio-temporal Model in Disease Mapping under the Bayesian Paradigm during 2011-18 in Guilan, Iran

Background: Congenital Hypothyroidism (CH) is one of the reasons for mental retardation and defective growth in neonates. It can be treated if it is diagnosed early. The congenital hypothyroidism can be diagnosed using newborn screening in the first days after birth. Disease mapping helps to identify high-risk areas of the disease. This study aimed to evaluate the pattern of CH using the Poisso...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015